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Abstract 

Beautiful formulas are known for the expected cost of random two-dimensional assignment 
problems, but in higher dimensions, even the scaling is not known. In 3 dimensions and above, 
■ the problem has natural "planar" and "axial" versions, both of which are NP-hard. For 3- 



dimensional Planar random assignment instances of size n, the cost scales as 0(l/n), and a main 
result of the present paper is the first polynomial-time algorithm that, with high probability, 
finds a solution of cost 0(n~ 1+e ), for arbitrary positive e (or indeed e going slowly to 0). For 
3-dimensional Axial assignment, the lower bound is fi(n), an d we give a new efficient matching- 
| based algorithm that returns a solution with expected cost 0(n log n). 

1 Introduction 

An instance of the (two dimensional) assignment problem may be thought of as an n x n cost 
array dj, a candidate solution is a permutation 7r: [n] i-» [n], its cost is Yli=l ^iMi) > anc ^ an 
optimal solution is one minimizing the cost. If the cost matrix represents, for example, the costs 
of assigning various jobs i to machines j, where each machine can accommodate only one job, 
| then the problem's solution represents the cheapest way of assigning the jobs to machines. It may 

equivalently be formulated as an integer linear program, minimizing the sum of selected elements 
consistent with the selection of exactly one element from each row and from each column, i.e., 
minimizing Ylij where Xjj € {0,1}, (Vi) Yyj = 1 and (Yi) T,j X ij = 1- This is a 

network flow problem, thus its linear relaxation with X{ j £ [0, 1] has integer extreme points, and 
the problem may be solved in polynomial time. 

The random assignment problem, in its most popular form, is the case when the entries of the 
cost matrix C are i.i.d. Exp(l) random variables (independent, identically distributed exponential 
random variables with parameter 1). Since anyway the problem can be solved in polynomial time, 
the focus for the random case is on the cost's expectation as a function of n, 

n 

f(n) = E min ^ Cj j7r(i ) = E min ^ XijC id 
i=i 1,3 i,j 

with Xi j € {0, 1} subject to the row and column constraints as above. This problem has received a 
great deal of study over several decades. It was considered from an operations research perspective 
in the 1960s [Don69], an asymptotic conjecture f(n) — > 7r 2 /6 = ((2) was formulated by statistical 
physicists Mezard and Parisi in the 1980s based on the mathematically sophisticated but non- 
rigorous "replica method" [MP85, MP87], an exact conjecture f(n) = Y^=i V* 2 was hazarded by 
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Parisi in the late 1990s [Par98] and a generalization to partial matchings and non-square matrices 
made by Coppersmith and Sorkin [CS99], the Mezard-Parisi conjecture was proved by Aldous in 
a pair of papers in 1992 and 2001 [Ald92, AldOl], the Coppersmith-Sorkin conjecture was proved 
simultaneously in 2004 by two papers using two very different methods [NPS05, LW04], and the 
study of other aspects of the random assignment problem and related problems is ongoing, for 
example by Wastlund in [Was09]. 

In higher dimensions there are two natural generalizations of the assignment problem. For 
example in three dimensions, the Planar assignment problem is, given an n x n x n matrix (or 
"tensor" or "array") C, to find a solution Xij^ minimizing J2i j k-^i,j,k Ci,j,k where X^j^ G {0, 1} 
and there is one selected value per "plane" of the array: 

Equivalently, it is to determine min^ Y2?=i Q,7r(i),o-(i)j the minimum taken over a pair of permuta- 
tions. The Axial three dimensional assignment problem is similar but with one selected value per 
"line" of the array: 

(Vi, j) = i ; (Vj , k) £ .V,,,, = 1, (Vi,fc) J2 X id,k = i- 

k i j 

The generalizations to higher dimensions are clear. In 3 dimensions and higher, the Planar and 
Axial assignment problems are both NP-hard. The Planar case d = 3 was one of the original 
problems listed by Karp [Kar72]. The complexity of the Axial problem was established by Frieze 
[Fri83]. 

The multidimensional random assignment problem we consider here is the case when the entries 
of the cost matrix are i.i.d. Exp(l)random variables. In this random setting, there are two natural 
questions. First, are there polynomial-time algorithms that find optimal or near-optimal solutions 
whp? Second, what is the expected cost of a minimum assignment? A random two-dimensional 
assignment instance has limiting expected cost ((2), and Frieze showed that the expected cost 
of a minimum spanning tree in the complete graph with random Exp(l) edge weights tends to 
C(3) [Fri85], so it is tantalizing to wonder if there might be similarly beautiful expressions for the 
expected cost in multi-dimensional versions of the random assignment problem. However, we do 
not even know how the cost scales with n. 

Some of the characteristics and applications of these problems are discussed in a recent book by 
Burkard, Dell'Amico and Martello [BDM09]. Very little is known about the probabilistic behavior 
of the minimum Z^ n of C(T) for d > 3, and even less is known about polynomial time algorithms 
for constructing good solutions. Grundel, Oliveira, Pasiliao and Pardalos [GOPP05] show that 
Z? — >■ whp in this case. Statistical physicists have conjectures based on "cavity" calculations 
[MMR04, MMR05], but there is no such nice constant as C(2), and no certainty even that the 
conjectured scaling is correct. 

2 Summary of results, methods, and limitations 
2.1 Planar assignment 

For the Planar d-dimensional assignment problem, there is an easy lower bound of Q(l/n d ~ 2 ) on 
the expected cost (see Theorem 1). Our main result (Theorem 2) is for the case d = 3. Here we 
give an algorithm that for any constant e > runs in time polynomial in n and yields a solution 
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of expected cost 0(l/n 1_£ ), and where e may be taken slowly to for a "mildly exponential time" 
algorithm yielding an approximation in the average case. Not only is this the first nearly tight 
upper bound obtained algorithmically, it is the only such bound except for one (see Theorem 1) 
following from a recent non-constructive result on hypergraph factors by Johansson, Kahn and Vu 
[JKV08]. 

Our algorithm may be thought of as an extension of one in [CS99] for 2-dimensional assignment. 
In that case, a bipartite matching was augmented by an alternating path of bounded length, 
with care taken to in regard to "conditioning" of the cost matrix. Here, partial assignments are 
augmented with a "bounded depth alternating path tree" , a tree in which a newly added element 
displaces two previously selected elements, those two elements are replaced in a way displacing 
four selected elements, and so on, until all the displaced elements are replaced by elements in a 
non-conflicting, "unassigned" set (see Figure 2.1). 
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Figure 1: Diagram of alternating-path tree; see Section 3.3.2. Adding new first coordinate i to the 
partial assignment using hyperedge (i,j, k) implies deletion of previous assignment elements 
and (k,k,k), with first coordinates j and k then reassigned respectively to elements (j,p,q) and 
(k,r,s), displacing four more existing assignment elements (p,p,p) etc., whose first coordinates are 
finally reassigned to unused second and third coordinates as (p, 61,62) etc. 

Given the tree's expansion it is surprising that this approach works, but for d = 3 it does. 
However, for d > 3, a more general approach fails. For any d > 2, our algorithm iterates a 
restricted form of local augmentation. (The main technical aspect of our result is restricting the 
augmentation so as to control the conditioning of the random cost matrix.) The last iteration 
replaces k selected assignment elements, spanning a submatrix that without loss of generality can 
be thought of as [k] d , with k+1 elements spanning a sumbmatrix [fc+1] (i-e., containing the smaller 
submatrix). When /c is a constant, the probability that there is any new such assignment of cost 
< w (even without the restrictions our algorithm imposes) is at most (/!)(&!) d_1 " wk = 0((nw) k ), 
so for the algorithm to have any hope of succeeding, for the last iteration alone we must budget 
a cost contribution w = f2(ra -1 ). That proves to be satisfactory (even taking all the iterations 
into account) for 2-dimensional assignment where the total cost we are aiming for is 0(1), and 
for 3-dimensional assignment with its lower bound of f2(re _1 ), but it is plainly unsatisfactory for 
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dimension 4, where the lower bound (which we imagine is close to the truth) is n(n~^ d ~ 2 ^) = ^(n~ 2 ). 

2.2 Axial assignment 

Our second main result (Theorem 3) is for Axial 3-dimensional assignment, and uses a greedy 
algorithm: once assignment elements are chosen, they are never replaced. The 3-dimensional 
assignment consists of n 2-dimensional assignments, with constraints between them. (Ignoring 
these constraints gives a relaxed 3-dimensional problem consisting of n 2-dimensional problems. Its 
expected minimum cost is n*^J- =l V* 2 ; proving that the true 3-dimensional cost has expectation 
f2(n).) The first assignment is chosen greedily (at expected cost J27=i l/^ 2 — 7I " 2 /6) from a complete 
bipartite graph K n ^ n . The second assignment is similar, but on a graph K n ^ n from which the edges 
of the first matching have been subtracted, so that each vertex has degree n — 2, and similarly for 
the subsequent matchings, down to the last one where each vertex has degree 1 and the matching 
is forced. 

The structure of the ith intermediate graph — K n , a minus a union of i — 1 edge-disjoint perfect 
matchings — is complex, but a general result of Dyer, Frieze, and McDiarmid [DFM86] is insensitive 
to the details and gives an upper bound of Inj (n— for the expectation of its minimum-cost per- 
fect matching, for a total expectation of 0{n log n), close to the £l{n) lower bound. Algorithmically 
there is no difficulty since we are just solving a set of minimum-cost assignment problems. 

As in the Planar case, though, our approach to the Axial problem falters for dimensions d > 4. 
Here, the 3-dimensional assignment problems are not necessarily regular, and by the end, there can 
even be 3-dimensional instances with no assignment at all: the greedy algorithm can fail. Perhaps 
this is not a problem in the average case, but at the least it necessitates a more careful analysis. 

2.3 Structure of the paper 

We deal with Planar assignment in Section 3, starting with easily proved non-constructive bounds 
(but where the upper bound relies on some heavy machinery). The next task is to analyze a 
"bounded depth alternating path tree algorithm" (BDAPTA) to prove Theorem 2. We analyze a 
three level version in Section 3.3, providing intuition. The general case is analyzed in Section 3.4, 
completing the proof. 

The Axial problem is considered in Section 4. The lower bound of Theorem 3 is proved in 
Section 4.2, and its upper bound in Section 4.3. 

3 Multi-Dimensional Planar Version 

3.1 Simple bounds 
Theorem 1 

Proof. Clearly 

n 

Z d,n >J2 m[n C iu-,id- 

i — ' »2,— >*d 
11=1 

Each term in the above sum is distributed as Exp(n rf_1 ) and so has expectation \jn d ~ x and variance 
l/n 2d ~ 2 . The Chebyshev inequality implies that the sum is concentrated around the mean. 
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For the upper bound we use a recent result of Johansson, Kahn and Vu [JKV08]. This implies 
that whp there is a solution that only uses <i-tuples of weight at most — . The upper bound 
follows immediately. It should be noted that their proof is non-constructive. □ 

3.2 Main theorem 

We fix the dimension to d = 3 for the rest of Section 3. 

Theorem 2 Suppose that 1 < k < 7log 2 logn where 7 is any constant strictly less than 1/2. Then, 
whp: 

(a) Algorithm BDAPTA(k) runs in time 0(n 2k+2 ). 

(b) The cost of the set of triples T output by BDAPTA(k) satisfies C(T) = 0(2 k n~ 1+9k logra), where 

®k = 2 k + 1 -i ■ 

We are not aware of any other polynomial time algorithm that will whp find a solution of value 
0(n~ 1+e ), for arbitrary positive e. 

3.3 Two Level Version of BDAPTA 

In this section we consider a two level version of the algorithm BDAPTA. In this way we hope that 
to make it easier to understand the general version that is described in Section 3.4. With reference 
to Theorem 2, the two-level version means taking k = 3, 9 = #3 = 1/7. 
The heuristic has three phases: 

3.3.1 Greedy Phase 

The first phase is a simple greedy procedure. 
Greedy Phase 

1. Let m = n - n 1 ' 6 , J = K = [n], and T = 0. 1 

2. For i = 1, . . . , n\ do the following: 

• Let Cjj-fc = min{Cij/ )jfc / : j' € J,k' G K}; 

• Add (i,j, k) to T and remove j from J and k from K. 

At the end of this procedure the triples in T provide a partial assignment. Let 

(i,j,k)eT 

Lemma 1 

2 

Z 1 < —-q whp. 

1 We will often pretend that some expressions are integer. Formally, we should round up or down but it will not 
matter. 
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Proof. We observe that if (i, j, k) 6 I then Cjj,jfc is the minimum of (n — z + l) 2 independent copies 
of Exp(l) and is therefore distributed as Exp((n — i + l) 2 ). Furthermore, the random variables 
(hJik) € T are independent. Using the facts that an Exp A random variable has mean 1/A 
and variance 1/ A 2 , 

- v i < r i+1 < 1 

'^(«- i + 1 ) 2 ~ Jx=i {n-x + lf- n^' 

Now 

1 = 1 

and the lemma follows from the Chebyshev inequality. □ 
3.3.2 Main Phase 

The aim of this phase is to increase the size of the partial assignment defined by T to n — 0(1). Let 
/ = I(T) be the set of first coordinates assigned in T, i.e., / = I(T) = {i : 3j, k s.t. k) € T}. 
Relabeling if necessary, without loss of generality we may assume that I = [\T\]. This phase will 
be split into rounds. We choose a small constant < a « 1 and let /3 = 1 — a. The aim of a 
round is to reduce the size of the set of unmatched first coordinates X(T) = [n] \ I(T) by a factor 
(3 while increasing the total cost of the matching only by an acceptably small amount. Thus we let 
x\ = n — n\ and xt = (3 t " 1 xi for t > 2. The aim of round t is to reduce |X(T)| from x% to xt+i- 
We continue this for to = ^°gi/p( x i/L) rounds where L is a large positive constant. Thus at the 
end of the Main Phase, if successful, we will have a partial assignment of size at least n — 2L. 

So suppose now that we are at the start of a round and that |X(T)| = x t . This is true for t = 1. 
Next let wq = 2n~ 12 / 7 logn and 



w t 



= 2n- G ' 7 x t 8/7 log 1 / 7 n for t > 1. 



At the start of each round we will refresh the array C with independent exponentials, at some cost. 
By this we mean that we replace C by a new array C where Cij^ < C[ ■ k + wt-i and the entries 
of C are i.i.d. Exp(l)random variables. More precisely, suppose that during the previous round 
we determined the precise values for all Cjj,fe < wt-i and left our state of knowledge for the other 
Cij : k as being at least wt-i- Then the memoryless property of exponentials means that 

c , _ \Ci,j,k ~ wt-i when C itj)k > w t -i 

l '^ k ) fresh ~ Exp(l) otherwise 

has the claimed property. Thus we can start a round with a fresh matrix of independent exponentials 
at the expense of adding another wt-i to each cost. We note also that we can whp carry out the 
Greedy Phase only looking at those Cij,jfc of value less than wq- 

Let Tt denote the value of T at the start of round t and let It = I(Tt),Xt = X{It). In round 
t we will add At = [n — Xt + 1, n — Xt+i] to It. By relabeling if necessary we will assume that at 
the start of round t we have T = {(i, : 1 < i < n — x t }. To add i £ A t to It we find distinct 
indices j, k,p, q,r,s G It (distinctness is not strictly necessary) and replace 6 of the triples in I t by 
7 new triples (see Figure 2.1: 

+ (i,j,k) - - (k,k,k) + (j,p,q) + (k,r,s) - (p,p,p) - (q,q,q) - (r,r,r) - (s,s,s)+ 

(P,6,6) + + (r,^ 5 ,&) + (1) 
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where £1, ■ ■ ■ ,£s are distinct members of X t , and each of the triples added in (1) is required to 
have (refreshed) cost at most wt- Roughly, we are assigning a new 1-coordinate i, this collides 
with previously used 2-coordinate j and 3-coordinate k, so the and (k,k,k) elements are 

removed from the existing assignment, 1-coordinates j and k are re-added as (j,p,q) and (k,r,s) 
thus colliding with the previous assignment elements (p,p,p), (q,q,q), (r,r,r), and (s,s,s), and 
finally 1-coordinates p, q, r, s are re-added as (p, £1, £2) etc., where the are elements not previously 
assigned. One may think of (1) as a binary tree version of an alternating-path construction; we 
will control the cost despite the tree's expansion. 

Putting Wt = wq + w\ + ■ ■ ■ + wt we see that if we can add one element to T at a cost of at most 
wt in refreshed costs, then in reality it costs us at most Wt; step (1) increases the cost by < 7Wt- 
Success in a round means doing this xt — xt+i times, in which case the additional cost of the Main 
Phase will be at most 7 times 

to to 

y^X x t - x t+iWt < xi(w + wi) + x t w t 

t=l t=2 

to 

< 3n" 6 / 7 log n + 2x~ 1/7 n^ 7 log 1 / 7 n ^ P~ t/7 < log n. (2) 

t=2 

We must now show that whp it is possible to add xt — xt+i = axt triples in round t with a 
(refreshed) cost of at most 7wt per triple. For this we fix t and drop the suffix t from all quantities 
that use it. We will treat refreshed costs as actual costs and drop the word "refreshed". 

We start by estimating the number of choices for assigning p. Ignoring other indices, the number 
of choices is distributed as the binomial Bm(u, 1 — e~ wx ) = Bin(z^, (1 — o(l))wx 2 ) where v = n — x. 

2 

Here 1 — e~ wx is the probability that for a given p, there exist ^1,^2 such that C Pt £ lt £ 2 < w. Note 
that 

wx 2 = 2(x/n) 6 / 7 log 1 / 7 n = o(l) and that wnx 2 2> logn 

and so the Chernoff bounds imply that, qs, 2 we can choose a set P of size exactly wnx 2 /2 = o(n), 
such that for each p G P there is at least one choice Ci;^2 £ X such that the triple (p, ^1,^2) is 
good, i.e., Cp ^ g 2 < w. Given this set of choices P we find that the number of choices for q £ P 
is distributed as the binomial Bin(^ — \P\, 1 — e~ wx ) and we can once again qs choose a set Q, 
disjoint from P such that \Q\ = wnx 2 /2 and each q € Q is in some good triple (#,£3, £4) where 
£3, £4 6 X. Similarly, we can choose sets R, S of choices for r, s, of size wnx 2 1 2, such that P, Q, R, S 
are pairwise disjoint. 

Observation 2 Each £ G X is in Bin(xu, 1 — e~ w ) good triples of the form (p € P, £',£") and so 
qs it is in at most 

4n 1 / 7 log 1 / 7 n 
2wnx = -7= 

such triples. 

We now discuss our choices for j and k. For a fixed j there are w 2 n 2 x A /4 pairs inPxQ and each 
has a probability 1 — e~ w of forming a good triple (j,p, q). Let j be useful if there is such a pair 
and useless otherwise. Then 

Pr(j is useless) < exp 



2 A sequence of events £ n , n > are said to occur quite surely, qs, if Pr(£„) = 1 — 0(n K ) for any constant K > 0. 



X V n 

wnx 



< 1 - 



wnx 
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It follows that the number of useful j £ Y = PUQU RU S dominates Bin(n — o(n), w 3 n 2 x 4 /5) and 
so qs we can choose a set J of useful j £ Y of size 

w 3 n 3 x 4 4n 3/7 x 4/7 log 3/7 n , , 
= 3 = ° {n) - 

We can by a similar argument choose a set K of useful k of this size disjoint from J and Y. 

Observation 3 A fixed p is in at most Bm(wn 2 x 2 /2, 1 — e~ w ) good triples (j,p,q) where (j,q) € 
J x Q and so qs every p is in at most w 2 n 2 x 2 such triples. 

Suppose then that in the middle of a round we have added y < ax triples to T. The number of 
£ € X that can be used in a good triple (p, £, rj) will have been reduced by y. The number of rj will 
have been reduced by the same amount. It follows from Observation 2 that the number of choices 
for p will have been reduced by at most 2ax x 2wnx. By Observation 3 this reduces the number 
of choices for j by at most 2ax x 2wnx x w 2 n 2 x 2 + 7ax <C | J\ = w 3 n 3 x 4 /6. The additional term 
+7ax accounts for the choices we lost because they have previously been used in this round. So 
our next i will get a choice of at least Bin((w; 3 n 3 x 4 /7) 2 , 1 — e~ w ) choices for a good triple k). 
So the expected number of choices is at least w 7 n G x s /49 = (2 7 /49)logn and then the probability 
there is no choice is o(n~ 1 ). This is sufficient to ensure that whp there is always at least one choice 
for every i. 

3.3.3 Final Phase 

We now have to add only 0(1) indices to /. At this point there is a problem with the bottom-up 
approach of the previous phase if x < 8, clearest in the case x = 1, say the single element n, when 
each of £i, . . . ,^8 would have to be n, leading to an illegal assignment. Thus instead we will work 
top down. The details of this will cause more conditioning of the matrix, and therefore we refresh 
C after each increase in /, at an extra cost of w = Kn^l 1 log 1 / 7 n. So, if successful, the cost of 
this round is O(Wt + w) = 0(n~ e / 7 log 1 / 7 n). 
Let us now replace the notation of (1) by 

+ (i,j,k)-(ji,j,h)-(k 1 ,k 2 ,k)+(j 1 ,p,q) + (k 1 ,r,s)-(p 1 ,p,p 3 )-(q 1 ,q 2 ,q)-(r 1 ,r, r 3 )-(si, s 2 , s)+ 

(pi,h,P3) + (<?i,<72, J3) + (n,S2,«3) + (s 1 ,k 2 ,r 3 ), (3) 

where any subtracted triple such as denotes a previous match (we are no longer assuming 

the convention that such a triple would be (j, j,j)), and where i 2 , 13 are unused 2- and 3-coordinates 
respectively. 

Fix j (and thus its previously matched companion indices and let Zj be the num- 

ber of choices for p,q (with their previously matched companion indices Pi,P3,qi,q 2 ) such that 
C(ji,p, q), C(pi, i 2 ,P3), C(qi, q 2 , js) < w. This has the distribution Bi(B 2 (n,w)B3(n,w),w) where 
Bi,B 2 ,Bz denote independent binomials, with B 2 counting the good choices for p, B3 those for 
q, and B\ those for j using these p and q possibilities. Using Chernoff bounds on the binomials 
B 2 ,Bz we see that whp Zj dominates B(n 2 w 2 /2, w) which dominates Be(n 2 w 3 /3), the Bernoulli 
random variable that is 1 with probability n 2 w 3 /3 and otherwise. The same holds for index k 
and (3) has been constructed so that choices for j, k are independent. So, the number of choices 
for j,k dominates Bin(n 2 , w(n 2 w 3 /3) 2 ) which has expectation f2(logn) and so is non-zero whp. 

This completes the analysis of BDAPTA when there are two levels. 
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3.4 General 3-Dimensional Version 



We follow the same three phase strategy. A; is a positive integer, 2 < k < 7 log log n. 
3.4.1 Greedy Phase 

This is much as before. Proceed as in Section 3.3.1 but taking = 9^ (recall 0's definition from 
Theorem 2) and defining n\ accordingly. Lemma 1 continues to hold. 



3.4.2 Main Phase 

Let 



a = 2 



-2fc-2 



(1-V273) 



and let /?, to and Xt, t = 1, . . . , to be defined as in Section 3.3.2. Let It, Xt,At have the same meaning 
as well. Now let wo = 2n~ 2 ( 1_6,fc ) logra and 

w t = 2x~ 1 ~ (lk n ek ~ 1 \og dk n for t > 1 

and 

W t = w + wi H h u/ t = O 



log 9fc n 



The aim of round t is once again to add xt — #t+i new indices to It using triples with (refreshed) 
cost at most wt- We will assume that at the start of round t we have T = {(i, : 1 < i < n — x t }. 
In analogy with (1), to add i G A t to I t we will add 2 k+l — 1 triples to T and remove 2 k+1 — 2 
triples, in which case the additional cost of the Main Phase will be at most 2 k+l — 1 times 

to to 

y^x x t - x t+i)w t < xi(wo + w^ + x t w t 

t=l t=2 

to 

< Zn 9 ^ 1 log n + 2x1 dk n 6k - x log 9 * n ^ ^ kt < An 8 ^ 1 log n. (4) 

t=2 

The notation used in (1) is obviously insufficient. We imagine a rooted tree T of triples. The root 
will be p = (io,jo, ko) where io is the index to be added to It- The root is at level zero. The triples 
at odd levels are to be deleted from T and the vertices at even levels are to be added to T. Every 
triple at an odd level 21 — 1 will therefore have the form (p,p,p) where p G If. This triple will have 
one child (p, a, b) which will replace the parent triple in 1-plane p. If I < k then a, b G It and if I = k 
then a, b G Xt- A triple u = (p, a, b) at an even level will have two children. By construction, u will 
be the unique triple in 1-plane p, but now we will have two triples in 2-plane a and 3-plane b. Thus 
the children of u are (a, a, a) and (6, b, b). This defines a tree corresponding to adding 2 k+1 — 1 and 
removing 2 k+1 — 2 triples from T. We ensure that if u = (p, a, b) is a triple at an even level, then 
p, a, b do not appear anywhere else in the tree, except at the child of u as previously described. We 
do this so that additions in one part of the tree do not clash with additions in another part and 
then the additions and deletions give rise to a partial assignment. We also insist that if u = (p, a, b) 
is a triple at an even level then C pA ^ < w. We call such a tree feasible. We considered each level 
of r to be ordered so it makes sense to talk of the rth vertex of level 21 where 1 < r < 2 l . 

We now have to show that whp there is always at least one such tree T for each i G A t . We 
take the same bottom-up approach that we did in Section 3.3. We fix t and drop the suffix t from 
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all quantities that use it. We start by estimating the number of choices for a p that can be in 
a triple (p,x,y) at level 2k. Ignoring other indices, the number of choices is again distributed 
as the binomial Bin(z^, 1 — e~ wx ) = Bin(z^, (1 — o{l))wx 2 ) where v = n — x = n — o(n). Note 
that wx 2 = K(x/n) l ~ 9k \og 6k n = o(l) and that wnx 2 = £l(n 9k ) » logn. (Here our notation 
f(n) » g(n) means that f(n)/g(n) — > oo with n). So the Chernoff bounds imply that qs we can 
choose a set P of size exactly wnx 2 / 2 = o(n), such that for each p £ P there is at least one choice 
£i,£2 such that the triple (p, ^1,^2) is good, i.e., C p ^ u ^ 2 < w. We will in fact be able to choose 2 k 
disjoint sets P^k, 1 < I < 2 k since replacing v by v — 2 k wnx 2 /2 will not significantly change the 
above calculations. (Here 2 k wnx 2 = 0(n l ~ 6k+6 i log efc+7 n) = o(n)). 

Observation 4 Each £ € X is in Bin(xv, 1 — e~ w ) good triples of the form (p € Pi,k,£,, •) and so 

(1 \ 
- ° gw J 3> log n ). 

Let 

vq = wnx 2 1 2 and = wnvf /2 for < / < fc. (5) 
The solution to this recurrence is 



(^) 2i+1 -V +1 = (nlognj^-^x^ 1 - 2 ^ 1 ^. 



Observe that v\ increases with I. Note also that if I < k — 2 then 

wu 2 < wu 2 _ 2 = 2 ^f 9k W 2 *- 1 ^ n = o(l), (6) 

w 2 n 2 x 2 /nlogn\ 20k 

wnvi > wni/Q = = 2 » logn. (7) 

2 \ x J 

We now have the basis for an inductive claim that qs if I < k — 1 and u = (p, a, b) is a triple at an 
even level 2{k — I) then there are at least v\ choices for p such that there exists a triple u = (p, a, b) 
with C u < w and a feasible tree T u with u as root and depth 21 + 1. Our analysis above has 
proved the base case of I = 0. Imagine now that we are filling in the possibilities for the rth triple 
(p,a,b) at level k — I. We fill in these possibilities level by level starting at level 2k. Imagine also 
that we have identified v\-\ choices for each of a, b. This can be an inductive assumption, so for 
example a will have to be a possible selection for the first component of the (2r — l)st triple at level 
2(fc-(*-l)). 

For a fixed p, conditional on our having selected exactly choices A, B for a, b, let p be useful 
if there is a pair (a, b) £ A x B with C p ^ a ^ < w and useless otherwise. Then, using (6), 



Pr(p is useless) < exp {—wv?_ 1 } < 1 — 



2wu 2 _ l 



It follows that the number of useful p that have not been previously selected dominates Bin(n — 
o(n), 2iw 2 _ 1 /3). Here o(n) = J2s<l ^ k s wv 2 bound the number of forbidden p's. It follows that qs 
we can choose a set of useful p's of size vi = wnv 2 _ x j2 = o{n). We can do this so that each node 
of T gets distinct choices. 

Observation 5 A fixed a is in at most Bin(ni^_i/2, 1 — e - "') good triples (p, a, b) feasible for level 
2(k — I) and so qs every a is in at most wnv\-\ such triples, see (7). 
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This completes our induction. We now apply the above to show that round t succeeds whp. 

Suppose that in the middle of a round we have added y < ax triples to T. The number of £ G X 
that can be used in a good triple (p, £, rj) at level 2k will have been reduced by y. Thus the number 
of choices for p in any triple in this level will have been reduced by at most 2 k x 2 x ax x 2wnx, 
see Observation 4. This reduces the number of choices for p in a triple at level 2(k — 1) by at most 
2 k+2 awnx 2 x wuvq = 2 k+s awnu 2 , see Observation 5. So let /U/ denote the number of choices for p 
in triples p(, ., .) at level 2(k — I) that are forbidden by choices further down the tree. We have just 
argued that \i\ < 2 k+3 awnuQ. In general we can use Observation 5 to conservatively argue that 

m < wni/i-^fn-! + 2 k+1 ax). 

It follows that for I > 2 we have 



'-<2^ + ^^<2^ + 2^a(^-Y~ 1)6k <2^± + 2 k + 2 a' ' ^ 



n n-\ n-\ n-\ \n\ognJ v x _ x " \n\ogn 

It follows that 



i/k-i v\ \n\ogn) 



We see that at the root there will still be at least (1 — 2 2k+2 a)vk-i choices for j , fc - So iq will 
get a choice of at least Bin((l — 2 2fc+2 a) 2 i/|_ 1 , 1 — e~ w ) choices for a good triple (io, jo, fco)- So the 
expected number of choices is at least 2wv 2 _ l /3, our choice of a implies this. Now wu^_ 1 = 21ogn 
and this is sufficient to ensure that whp there is always at least one choice for every iq. 



3.4.3 Final Phase 

We can execute the Main Phase so long as x > 2 k . Now assume that 1 < x < 2 k . We now 
have to add only 0(1) indices to /. This time we refresh C an 0(2 k ) number of times at an 
log dk n 

extra cost of Wf = — . „ each time we add an index. So, if successful, the cost of this round is 

TL k 

O(W t0 +w f ) = o(ffi). 

We first make an inductive assumption: We have a partial assignment I where |/| < n — 2. (The 
reader might think that we should assume |/| < n — 1, but here we use the induction hypothesis 
after one more index has temporarily been deleted from /, prior to a replacement). Assume that the 
matrix C is unconditioned and i £ I: Then we can in 0(n 2£ ) time whp find a set P of size z^_i (with 
x = 1 in definition (5)) and a collection Q p ,p G P of sets of size z^_i such that for each (p, q G P p ) 
there is an assignment P' with (i,p, q) G P' and I(P') ^ I(P) and C(P') = C(P)+C(i,p, q)+0(w f ). 
This is true for £ = 1 since we can make the changes 

+(i,p,q) - {pi,p,P3) - (qi,Q2,q) + {pi,i2,P3) + (qi,q2,h) 

where i2,h are unused 2- and 3-coordinates respectively. The number of choices for p, q are inde- 
pendent Bin(n, w). 

For the inductive step, we first refresh the matrix C. Then for each p G [n] we let I' = I — {p} 
and apply the induction hypothesis to generate v 2 _ 2 choices of assignment that add back p\ to 
We find that whp at least wv 2 _ 2 /2 = V£_i of these have C(p±, ., .) < w. Let this set be P. Now 
refresh C again and apply the same argument for each p G P to generate choices Q p for p. This 
completes the induction. 
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Now let I = k and refresh C one more time. Let P, Q p ,p £ P be the sets of size i>k-i promised 
by the above argument. We have Bin(z/|_ 1 , w) choices of j, k which can be used to add i £ I to I 
at a cost of 0(w). In expectation this is 21ogn and so we succeed whp. 

For the execution time of the algorithm we observe that to add a triple to / involves replacing 
2 k+1 — 2 triples by 2 k+l — 1 triples. This involves the construction of a tree T where for the 2 k+l — 1 
vertices that we add, we have n 2 choices to make, in the worst-case and for which the other choices 
are forced. This gives a running time of O (ji x (n 2 ) 2fc+1_1 ^ time. 

This completes the proof of Theorem 2. □ 

4 Multi-Dimensional Axial Version 
4.1 Main theorem 

Here we give our main theorem for Axial assignment. 

Theorem 3 The optimal solution value Z^ n satisfies the following: 

(a) Zf n = n(n d ' 2 ) whp for d>3. 

(b) When d = 3 there is a polynomial time algorithm that finds a solution with cost Z where 

Z = O(nlogn) whp. 

The theorem is proved in the next two sections. 



4.2 Lower bound 

It is clear that Z^ n > Z\ + Z2 + • • • + Z n d-2 where Zj is the minimum cost of the 2-dimensional 
assignment with cost matrix Aj^ = C , ji,...,i d _ 2 ,j,fc- We know that Zj > (1 — o(l))£(2) whp and the 
Zi's are independent. It follows that whp Z^ > (1 - o(l))n d ~ 2 C(2) > 3n d ~ 2 /2. 

4.3 Upper bound for d = 3 

For the upper bound we need a result of Dyer, Frieze and McDiarmid [DFM86]. We will not state 
it in full generality, instead we will tailor its statement to precisely what is needed. Suppose that 
we have a linear program 

P : Minimize c T x subject to Ax = b, x > 0. 

Here A is an m x n matrix and the cost vector c = (c±,C2, ■ ■ ■ ,c n ) is a sequence of independent 
copies of Exp(l). Let Zp denote the minimum of this linear program. Note that Zp is a random 
variable. Next let y be any feasible solution to P. 



Theorem 4 ([DFM86]) 



E(Zp) < m max y,j. (8) 

j=l,2,...,n 



Furthermore, Zp is at most 1 + o(l) times the RHS of (8), whp. 

Now consider the following greedy-type algorithm. We find a minimum 2-dimensional assignment 
for 1-plane i = 1, we then find a minimum assignment for 1-plane i = 2, consistent with choice for 
1-plane i = 1, and so on: 
Greedy 



12 



1. For i = 1, . . . , n do the following: 



• Let G = K n , n \ (Mi U M 2 U • • • Mj_i); 
. If €£(G) let A j>k = C i>jtk . 

• Let Mj be a minimum cost matching of G using edge weights A. 

The output, Mi, M 2 , . . . , M n defines a set of triples T = k) : (j, k) € Mj}. We claim that if 

Zi = A(Mi) then 

2n 

E(Z0 < — -. (9) 

n — z + 1 

For this we apply Theorem 4 to the linear program 

Minimize Aj^Xj^ subject to 

(j,k)eE(G) 

^2 x j,k = 1 > j = l,2,...,n 

fc: (j,k)eE(G) 

^2 x j:k = l, fc = l,2,...,rc 

j: (j,k)eE(G) 

Xj,k>0, j,k = l,2,...,n. 

We note that there are 2n constraints and that x^fc = l/(n — i + 1) is a feasible solution. With 
Theorem 4, this implies (9) and the upper bound in Theorem 3 for the case d = 3. 



5 Conclusions 

For the 2-dimensional random assignment problem, we know the limiting expected cost, and a 
given instance can be solved in polynomial time. As noted in the Introduction, much less is known 
about multidimensional assignment problems, and as far as we are aware, nothing was known about 
polynomial-time algorithms solving these problems well on average. For the 3-dimensional Axial 
assignment problem, the present paper is the first to prove an upper bound within a logarithmic 
factor of the obvious 0{n) lower bound (likely the true answer), doing so by analyzing a simple 
and fast greedy algorithm. For the 3-dimensional Planar assignment problem, we give the first 
upper bounds within polylogarithmic factors of the obvious 0(l/n) lower bound. One of our upper 
bounds is a trivial application of a result of Johansson, Kahn and Vu [JKV08]; we cannot really take 
credit for it. The second, however, comes from analyzing an algorithm that is the first that solves 
this problem well, on average. For any e > (logn)~ 7 , for any constant 7 > 1/2, run for "mildly 
exponential" time (polynomial time if e is constant), the algorithm delivers a solution within a 
factor 0(n £ ) of the expected minimum; we know of no other such approximation. As discussed in 
Section 2, our results do not extend to d > 4. 

We are left with open questions including these: 

PI What are the growth rates of E[Z£j and E[Z^J for d > 3? 

P2 Are there asymptotically optimal, polynomial time algorithms for solving these problems when 
d> 3? 

P3 For d > 3, are there polynomial time algorithms yielding solutions within logarithmic or 0{n e ) 
factors for Axial and Planar assignment problems (as we have given for d = 3)? 
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P4 Frieze [Fri74] gave a bilinear programming formulation of the 3-dimensional planar problem. 
There is a natural heuristic associated with this formulation (see Appendix). What are its 
asymptotic properties? 
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A Bilinear Programming Formulation 

Frieze [Fri74] re-formulated the 3-dimensional planar problem as 

n 

Minimize ^ C i: ^ k y i}j z itk subject to x,y 6 Pa 

i,j,k=l 

where Pa is the bipartite matching polyhedron Y^=i x i,j = 1 = Sj=i x i,ji f° r au 1 — J — n - 

Now denote the objective above by C(y,z). The following heuristic was used successfully in a 
practical situation [FY81]: 

1. Choose Vq,zq arbitrarily; Zq = C(yo,zo); i = 0. 

2. Repeat until Zj+i = Zj. 

• Let yi + i maximize C(y,Zi). 

• Let Zi+i maximize C(yi + ±,z). 

• Z i+ i = C(y i+1 ,z i+1 ). 

• % = i + 1. 
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